56 resultados para GENOMIC SEQUENCE

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The intensely studied MHC has become the paradigm for understanding the architectural evolution of vertebrate multigene families. The 4-Mb human MHC (also known as the HLA complex) encodes genes critically involved in the immune response, graft rejection, and disease susceptibility. Here we report the continuous 1,796,938-bp genomic sequence of the HLA class I region, linking genes between MICB and HLA-F. A total of 127 genes or potentially coding sequences were recognized within the analyzed sequence, establishing a high gene density of one per every 14.1 kb. The identification of 758 microsatellite provides tools for high-resolution mapping of HLA class I-associated disease genes. Most importantly, we establish that the repeated duplication and subsequent diversification of a minimal building block, MIC-HCGIX-3.8–1-P5-HCGIV-HLA class I-HCGII, engendered the present-day MHC. That the currently nonessential HLA-F and MICE genes have acted as progenitors to today’s immune-competent HLA-ABC and MICA/B genes provides experimental evidence for evolution by “birth and death,” which has general relevance to our understanding of the evolutionary forces driving vertebrate multigene families.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

We examined the MLL genomic translocation breakpoint in acute myeloid leukemia of infant twins. Southern blot analysis in both cases showed two identical MLL gene rearrangements indicating chromosomal translocation. The rearrangements were detectable in the second twin before signs of clinical disease and the intensity relative to the normal fragment indicated that the translocation was not constitutional. Fluorescence in situ hybridization with an MLL-specific probe and karyotype analyses suggested t(11;22)(q23;q11.2) disrupting MLL. Known 5′ sequence from MLL but unknown 3′ sequence from chromosome band 22q11.2 formed the breakpoint junction on the der(11) chromosome. We used panhandle variant PCR to clone the translocation breakpoint. By ligating a single-stranded oligonucleotide that was homologous to known 5′ MLL genomic sequence to the 5′ ends of BamHI-digested DNA through a bridging oligonucleotide, we formed the stem–loop template for panhandle variant PCR which yielded products of 3.9 kb. The MLL genomic breakpoint was in intron 7. The sequence of the partner DNA from band 22q11.2 was identical to the hCDCrel (human cell division cycle related) gene that maps to the region commonly deleted in DiGeorge and velocardiofacial syndromes. Both MLL and hCDCrel contained homologous CT, TTTGTG, and GAA sequences within a few base pairs of their respective breakpoints, which may have been important in uniting these two genes by translocation. Reverse transcriptase-PCR amplified an in-frame fusion of MLL exon 7 to hCDCrel exon 3, indicating that an MLL-hCDCrel chimeric mRNA had been transcribed. Panhandle variant PCR is a powerful strategy for cloning translocation breakpoints where the partner gene is undetermined. This application of the method identified a region of chromosome band 22q11.2 involved in both leukemia and a constitutional disorder.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000–100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

There is a need for faster and more sensitive algorithms for sequence similarity searching in view of the rapidly increasing amounts of genomic sequence data available. Parallel processing capabilities in the form of the single instruction, multiple data (SIMD) technology are now available in common microprocessors and enable a single microprocessor to perform many operations in parallel. The ParAlign algorithm has been specifically designed to take advantage of this technology. The new algorithm initially exploits parallelism to perform a very rapid computation of the exact optimal ungapped alignment score for all diagonals in the alignment matrix. Then, a novel heuristic is employed to compute an approximate score of a gapped alignment by combining the scores of several diagonals. This approximate score is used to select the most interesting database sequences for a subsequent Smith–Waterman alignment, which is also parallelised. The resulting method represents a substantial improvement compared to existing heuristics. The sensitivity and specificity of ParAlign was found to be as good as Smith–Waterman implementations when the same method for computing the statistical significance of the matches was used. In terms of speed, only the significantly less sensitive NCBI BLAST 2 program was found to outperform the new approach. Online searches are available at http://dna.uio.no/search/

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The TEL (ETV6)−AML1 (CBFA2) gene fusion is the most common reciprocal chromosomal rearrangement in childhood cancer occurring in ≈25% of the most predominant subtype of leukemia— common acute lymphoblastic leukemia. The TEL-AML1 genomic sequence has been characterized in a pair of monozygotic twins diagnosed at ages 3 years, 6 months and 4 years, 10 months with common acute lymphoblastic leukemia. The twin leukemic DNA shared the same unique (or clonotypic) but nonconstitutive TEL-AML1 fusion sequence. The most plausible explanation for this finding is a single cell origin of the TEL-AML fusion in one fetus in utero, probably as a leukemia-initiating mutation, followed by intraplacental metastasis of clonal progeny to the other twin. Clonal identity is further supported by the finding that the leukemic cells in the two twins shared an identical rearranged IGH allele. These data have implications for the etiology and natural history of childhood leukemia.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A computational system for the prediction of polymorphic loci directly and efficiently from human genomic sequence was developed and verified. A suite of programs, collectively called pompous (polymorphic marker prediction of ubiquitous simple sequences) detects tandem repeats ranging from dinucleotides up to 250 mers, scores them according to predicted level of polymorphism, and designs appropriate flanking primers for PCR amplification. This approach was validated on an approximately 750-kilobase region of human chromosome 3p21.3, involved in lung and breast carcinoma homozygous deletions. Target DNA from 36 paired B lymphoblastoid and lung cancer lines was amplified and allelotyped for 33 loci predicted by pompous to be variable in repeat size. We found that among those 36 predominately Caucasian individuals 22 of the 33 (67%) predicted loci were polymorphic with an average heterozygosity of 0.42. Allele loss in this region was found in 27/36 (75%) of the tumor lines using these markers. pompous provides the genetic researcher with an additional tool for the rapid and efficient identification of polymorphic markers, and through a World Wide Web site, investigators can use pompous to identify polymorphic markers for their research. A catalog of 13,261 potential polymorphic markers and associated primer sets has been created from the analysis of 141,779,504 base pairs of human genomic sequence in GenBank. This data is available on our Web site (pompous.swmed.edu) and will be updated periodically as GenBank is expanded and algorithm accuracy is improved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Chinese hamster ovary (CHO) mutant UV40 cell line is hypersensitive to UV and ionizing radiation, simple alkylating agents, and DNA cross-linking agents. The mutant cells also have a high level of spontaneous chromosomal aberrations and 3-fold elevated sister chromatid exchange. We cloned and sequenced a human cDNA, designated XRCC9, that partially corrected the hypersensitivity of UV40 to mitomycin C, cisplatin, ethyl methanesulfonate, UV, and γ-radiation. The spontaneous chromosomal aberrations in XRCC9 cDNA transformants were almost fully corrected whereas sister chromatid exchanges were unchanged. The XRCC9 genomic sequence was cloned and mapped to chromosome 9p13. The translated XRCC9 sequence of 622 amino acids has no similarity with known proteins. The 2.5-kb XRCC9 mRNA seen in the parental cells was undetectable in UV40 cells. The mRNA levels in testis were up to 10-fold higher compared with other human tissues and up to 100-fold higher compared with other baboon tissues. XRCC9 is a candidate tumor suppressor gene that might operate in a postreplication repair or a cell cycle checkpoint function.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Transposon mutagenesis provides a direct selection for mutants and is an extremely powerful technique to analyze genetic functions in a variety of prokaryotes. Transposon mutagenesis of Mycobacterium tuberculosis has been limited in part because of the inefficiency of the delivery systems. This report describes the development of conditionally replicating shuttle phasmids from the mycobacteriophages D29 and TM4 that enable efficient delivery of transposons into both fast- and slow-growing mycobacteria. These shuttle phasmids consist of an Escherichia coli cosmid vector containing either a mini-Tn10(kan) or Tn5367 inserted into a nonessential region of the phage genome. Thermosensitive mutations were created in the mycobacteriophage genome that allow replication at 30°C but not at 37°C (TM4) or 38.5°C (D29). Infection of mycobacteria at the nonpermissive temperature results in highly efficient transposon delivery to the entire population of mycobacterial cells. Transposition of mini-Tn10(kan) occurred in a site-specific fashion in M. smegmatis whereas Tn5367 transposed apparently randomly in M. phlei, Bacille Calmette–Guérin (BCG), and M. tuberculosis. Sequence analysis of the M. tuberculosis and BCG chromosomal regions adjacent to Tn5367 insertions, in combination with M. tuberculosis genomic sequence and physical map data, indicates that the transpositions have occurred randomly in diverse genes in every quadrant of the genome. Using this system, it has been readily possible to generate libraries containing thousands of independent mutants of M. phlei, BCG, and M. tuberculosis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Asparaginyl-tRNA (Asn-tRNA) and glutaminyl-tRNA (Gln-tRNA) are essential components of protein synthesis. They can be formed by direct acylation by asparaginyl-tRNA synthetase (AsnRS) or glutaminyl-tRNA synthetase (GlnRS). The alternative route involves transamidation of incorrectly charged tRNA. Examination of the preliminary genomic sequence of the radiation-resistant bacterium Deinococcus radiodurans suggests the presence of both direct and indirect routes of Asn-tRNA and Gln-tRNA formation. Biochemical experiments demonstrate the presence of AsnRS and GlnRS, as well as glutamyl-tRNA synthetase (GluRS), a discriminating and a nondiscriminating aspartyl-tRNA synthetase (AspRS). Moreover, both Gln-tRNA and Asn-tRNA transamidation activities are present. Surprisingly, they are catalyzed by a single enzyme encoded by three ORFs orthologous to Bacillus subtilis gatCAB. However, the transamidation route to Gln-tRNA formation is idled by the inability of the discriminating D. radiodurans GluRS to produce the required mischarged Glu-tRNAGln substrate. The presence of apparently redundant complete routes to Asn-tRNA formation, combined with the absence from the D. radiodurans genome of genes encoding tRNA-independent asparagine synthetase and the lack of this enzyme in D. radiodurans extracts, suggests that the gatCAB genes may be responsible for biosynthesis of asparagine in this asparagine prototroph.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The isolation and study of Anopheles gambiae genes that are differentially expressed in development, notably in tissues associated with the maturation and transmission of the malaria parasite, is important for the elucidation of basic molecular mechanisms underlying vector–parasite interactions. We have used the differential display technique to screen for mRNAs specifically expressed in adult males, females, and midgut tissues of blood-fed and unfed females. We also screened for mRNAs specifically induced upon bacterial infection of larval stage mosquitoes. We have characterized 19 distinct cDNAs, most of which show developmentally regulated expression specificity during the mosquito life cycle. The most interesting are six new sequences that are midgut-specific in the adult, three of which are also modulated by blood-feeding. The gut-specific sequences encode a maltase, a V-ATPase subunit, a GTP binding protein, two different lectins, and a nontrypsin serine protease. The latter sequence is also induced in larvae subjected to bacterial challenge. With the exception of a mitochondrial DNA fragment, the other 18 sequences constitute expressed genomic sequence tags, 4 of which have been mapped cytogenetically.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Caenorhabditis elegans should soon be the first multicellular organism whose complete genomic sequence has been determined. This achievement provides a unique opportunity for a comprehensive assessment of the signal transduction molecules required for the existence of a multicellular animal. Although the worm C. elegans may not much resemble humans, the molecules that regulate signal transduction in these two organisms prove to be quite similar. We focus here on the content and diversity of protein kinases present in worms, together with an assessment of other classes of proteins that regulate protein phosphorylation. By systematic analysis of the 19,099 predicted C. elegans proteins, and thorough analysis of the finished and unfinished genomic sequences, we have identified 411 full length protein kinases and 21 partial kinase fragments. We also describe 82 additional proteins that are predicted to be structurally similar to conventional protein kinases even though they share minimal primary sequence identity. Finally, the richness of phosphorylation-dependent signaling pathways in worms is further supported with the identification of 185 protein phosphatases and 128 phosphoprotein-binding domains (SH2, PTB, STYX, SBF, 14-3-3, FHA, and WW) in the worm genome.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In view of the well-established role of neurohypophysial hormones in osmoregulation of terrestrial vertebrates, lungfishes are a key group for study of the molecular and functional evolution of the hypothalamo-neurohypophysial system. Here we report on the primary structure of the precursors encoding vasotocin (VT) and [Phe2]mesotocin ([Phe2]MT) of the Australian lungfish, Neoceratodus forsteri. Genomic sequence analysis and Northern blot analysis confirmed that [Phe2]MT is a native oxytocin family peptide in the Australian lungfish, although it has been reported that the lungfish neurohypophysis contains MT. The VT precursor consists of a signal peptide, VT, that is connected to a neurophysin by a Gly-Lys-Arg sequence, and a copeptin moiety that includes a Leu-rich core segment and a glycosylation site. In contrast, the [Phe2]MT precursor does not contain a copeptin moiety. These structural features of the lungfish precursors are consistent with those in tetrapods, but different from those in teleosts where both VT and isotocin precursors contain a copeptin-like moiety without a glycosylation site at the carboxyl terminals of their neurophysins. Comparison of the exon/intron organization also supports homology of the lungfish [Phe2]MT gene with tetrapod oxytocin/MT genes, rather than with teleost isotocin genes. Moreover, molecular phylogenetic analysis shows that neurohypophysial hormone genes of the lungfish are closely related to those of the toad. The present results along with previous morphological findings indicate that the hypothalamo-neurohypophysial system of the lungfish has evolved along the tetrapod lineage, whereas the teleosts form a separate lineage, both within the class Osteichthyes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

DNA methylation is an important regulator of genetic information in species ranging from bacteria to humans. DNA methylation appears to be critical for mammalian development because mice nullizygous for a targeted disruption of the DNMT1 DNA methyltransferase die at an early embryonic stage. No DNA methyltransferase mutations have been reported in humans until now. We describe here the first example of naturally occurring mutations in a mammalian DNA methyltransferase gene. These mutations occur in patients with a rare autosomal recessive disorder, which is termed the ICF syndrome, for immunodeficiency, centromeric instability, and facial anomalies. Centromeric instability of chromosomes 1, 9, and 16 is associated with abnormal hypomethylation of CpG sites in their pericentromeric satellite regions. We are able to complement this hypomethylation defect by somatic cell fusion to Chinese hamster ovary cells, suggesting that the ICF gene is conserved in the hamster and promotes de novo methylation. ICF has been localized to a 9-centimorgan region of chromosome 20 by homozygosity mapping. By searching for homologies to known DNA methyltransferases, we identified a genomic sequence in the ICF region that contains the homologue of the mouse Dnmt3b methyltransferase gene. Using the human sequence to screen ICF kindreds, we discovered mutations in four patients from three families. Mutations include two missense substitutions and a 3-aa insertion resulting from the creation of a novel 3′ splice acceptor. None of the mutations were found in over 200 normal chromosomes. We conclude that mutations in the DNMT3B are responsible for the ICF syndrome.